Versions:

3.0.5

vibe is an open-source desktop transcription utility developed by thewh1teagle that converts audio and video files into text entirely offline through a graphical interface powered by OpenAI Whisper. Designed for journalists, students, podcasters, researchers, and anyone who needs accurate speech-to-text conversion without cloud dependencies, the program accepts common media formats such as MP3, WAV, M4A, MP4, MOV, and MKV, processing them locally to produce time-stamped or plain transcripts while keeping sensitive recordings on the user’s machine. The software, currently at version 3.0.5 and offered in a single edition, leverages the Whisper model’s multilingual capabilities to recognize dozens of languages and dialects, automatically detecting the spoken tongue and offering optional subtitle export in SRT or VTT for video editing workflows. Because all computation runs on the CPU or optional GPU acceleration, no Internet connection or API key is required after the initial model download, ensuring privacy and eliminating recurring costs. Typical use cases include generating interview transcripts for media production, creating accessible captions for lectures, converting meeting recordings into searchable notes, and archiving personal voice memos. The lightweight application presents a minimal drag-and-drop window where users select audio quality presets—ranging from “tiny” for speed to “large” for maximum accuracy—and receive progress feedback before saving results in TXT, JSON, or subtitle formats. vibe is available for free on get.nero.com, with downloads provided via trusted Windows package sources (e.g. winget), always delivering the latest version, and supporting batch installation of multiple applications.

Tags:

ai 171

cross-platform 169

desktop 66

openai 21

rust 177

transcribe 10

whisper 5